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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) APPLICANT: 

(A) NAME: THE GOVERNMENT OF THE UNITED 
STATES OF AMERICA AS REPRESENTED BY THE 
SECRETARY, DEPARTMENT OF HEALTH AND HUMAN 
SERVICES 

(B) STREET: OFFICE OF TECHNOLOGY TRANSFER, 
SUITE 325 

6011 EXECUTIVE BOULEVARD 

( C ) CITY : ROCKVI LLE 

O (D) STATE OR PROVINCE: MARYLAND 

*0 (E) COUNTRY: UNITED STATES OF AMERICA 

TU (F) POSTAL CODE: 2 0852 

py (i) APPLICANT: 



(A) NAME: NELSON, EDWARD L. 

(B) STREET: 660 BUCKHORN ROAD 

(C) CITY: ELDERS BURG 

(D) STATE OR PROVINCE: MARYLAND 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE: 21784 



(i) APPLICANT: 

^0 (A) NAME: NELSON, PETER J. 

(B) STREET: REUTTERSTRASSE 7 0 

(C) CITY: MUNICH 

(E) COUNTRY: GERMANY 

(F) POSTAL CODE: 80689 

(ii) TITLE OF INVENTION: NOVEL VECTOR FOR 
POLYNUCLEOTIDE VACCINES 



(iii) NUMBER OF SEQUENCES: 29 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: MORGAN & FINNEGAN, L.L.P. 

(B) STREET: 34 5 PARK AVENUE 

(C) CITY: NEW YORK 

(D) STATE: NEW YORK 

(E) COUNTRY: USA 

(F) ZIP: 10154 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: DISKETTE, 3,5 INCH, 1.44 MB 

(B) COMPUTER: IBM PC COMPATIBLE 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: WORDPERFECT 5,1 
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(vi) CURRENT APPLICATION DATA: 

(A) FILING DATE: 14-AUG-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 6 0/023931 

(B) FILING DATE: 14 -AUG- 199 6 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: KATHRYN M. BROWN 

(B) REGISTRATION NUMBER: 3 4 556 

(C) REFERENCE/ DOCKET NUMBER: 2026-4236PC 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 758-4800 

(B) TELEFAX: (212) 751-6849 

(C) TELEX: 421792 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

( D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



GGCCGCGTTG CTGGCGTTTT TCCATAGGCT CCGCCCCCCT 4 0 

GACGAGCATC ACAAAAATCG ACGCTCAAGT CAGAGGTGGC 80 

GAAACCCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC 120 

TGGAAGCTCC CTCGTGCGCT CTCCTGTTCC GACCCTGCCG 160 

CTTACCGGAT ACCTCTCCGC CTTTCTCCCT TCGGGAAGCG 200 

TGGCGCTTTC TCAATGCTCA CGCIGTAGGT ATCTCAGTTC 240 

GGTGTAGGTC GTTCGCTCCA AG CTGGG CTG TGTGCACGAA 280 

CCCCCCGTIC AGCCCGACCG CTGCGCCTTA TCCGGTAACT 320 

ATCGTCTTGA GTCCAACCCG GTAAGACACG ACTTATCGCC 3 60 

ACTGGCAGCA GCCACTGGTA ACAGGATTAG CAGAGCGAGG 4 00 

TATGTAGGCG GTGCTACAGA GTTCTTGAAG TGGTGGCCTA 44 0 

ACTACGGCTA CAC 4 53 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 2: 

GTGTAGCCGT AGTTAGGCCA CCACTTCAAG AACTCTGTAG 40 

CACCGCCTAC ATACCTCGCT CTGCTAATCC TGTTACCAGT 80 

s ^ GGCTGCTGCC AGTGGCGATA AGTCGTGTCT TACCGGGTTG 120 

O GACTCAAGAC GATAGTTACC GGATAAGGCG CAGCGGTCGG 160 

!p GCTGAACGGG GGGTTCGTGC ACACAGCCCA GCTTGGAGCG 2 00 

^ AACG AC CT AC ACCGAACTGA GATACCTACA CCGTGAGCAT 24 0 

T TGAGAAAGCG CCACGCTTCC CGAAGGGAGA AAGGCGGACA 280 

HJ GGTATCCGGT AAGCGGCAGG GTCGGAACAG GAGAGCGCAC 32 0 

W GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA TCTTTATAGT 360 

O CCTGTCGGGT TTCGCCACCT CTGACTTGAG CGTCGATTTT 4 00 

PJ TGTGATGCTC GTCAGGGGGG CGGAGCCTAT GGAAAAACGC 44 0 

s CAGCAACGCG GCC 453 

□ (2) INFORMATION FOR SEQ ID NO: 3: 

;^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 base pairs 
^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAATTCTTTC GGACTTTTGA AAGTGATGGT GGTGGCCGAA 4 0 

GGATTCGAAC CTTCGAAGTC GATGACGGCA GATTTAGAGT 80 

CTGCTCCCTT TGGCCGCTCG GGAACCCCAC CACGGGTAAT 12 0 

GCTTTTACTG GCCTGCTCCC TTATCGGGAA GCGGGGCGCA 160 

TCATATCAAA TGACGCGCCG CTGTAAAGTG TTACGTTGAG 200 

AAAGAATTC 210 

(2) INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4: 

GAATTCTTTC TCAACGTAAC ACTTT AC AG C GGCGCGTCAT 4 0 

TTGATATGAT GCGCCCCGCT TCCCGATAAG GGAGCAGGCC 80 

AGTAAAAGCA TTACCCGTGG TGGGGTTCCC GAGCGGCCAA 120 

AGGGAGCAGA CTCTAAATCT GCCGTCATCG ACTTCGAAGG 160 

TTCGAATCCT TCCCCCACCA CCATCACTTT CAAAAGTCCG 2 00 
AAAGAATTC 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5: 

AATAAA 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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ATTAAA 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 7: 
AGTAAA 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL : No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

AAGAAC 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 



(iii) 



HYPOTHETICAL: No 
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(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 9: 

AATACA 6 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 10: 

GCCTTAAGGG CCATATGGTG AGTGGATCCC TTGACCCCAG 4 0 

GCGGGGATGG GGAGACCTG T AGTC AG AG C CCCCGGGCAG 80 

CACAGGCCAA TGCCCGTCCT TCCCCTGCAG GATGAGTAGT 12 0 

GAGTGCCTCT CCTGGCCCTG GAAGTTGCCA CTCCAGTGCC 160 

CACCAGCCTT GTCCTAATAA AATTAAGTTG CATCATTTTG 2 00 

TCTGACTAGG TGTCCTCTAT AATATTAT 228 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

( iv ) ANTI-SENSE : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 11: 



ATAATATTAT AG AGG AC AC C TAGTCAGAAC AAATG ATG C A 
ACTTAATTTT ATTAGGACAA GGCTGGTGGG CACTGGAGTG 
GCAACTTCCA GGGCCAGGAG AGGCACTCAC TACTCATCCT 



40 
80 
120 
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GCAGGGGAAG GACGGGCATT GGCCTGTGCT GCCCGGGGGC 160 

TCTGACTACA GGTCTCCCCC ATCCCCGCCT GGGGTCAAGG 2 00 

CATCCACTCA CCATATGGCC CTTAAGG 228 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

( iv) ANTI-SENSE : No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CCTCGGTACC TGCCATGGCG CGGATTCTTT ATCACTGATA 40 

AGTTGGTGGA CATATTATGT TTATCAGTGA TAAAGTGTCA 80 

AGCATGACAA AGTTGCAGCC GAATACAGTG ATCCGTGCCG 12 0 

GCCCTGGACT GTTGAACGAG GTCGGCGTAG ACGGTCTGAC 160 

GACACGCAAA CTGGCGGAAC GGTTGGGGGT GCAGCAGCCG 2 00 

GCGCTTTACT GGCACTTCAG GAACAAGCGG GCGCCTTAAG 240 

GGCCATATGC CG 252 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHET I C AL : No 

( iv) ANTI-SENSE : No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CCTCGGTACC TGCCACCATG GCGCGGATTC TTTAT 3 5 



(2) INFORMATION FOR SEQ ID NO: 14: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 14: 



CGGCATATGG CCTTAAGGCG CCCGCTTGTT CCTGAAGT 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETI CAL : No 

(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15: 



GCCTTAAGGG CCATATGGTG AGTGGATGCC TTGACCCCAG 
GCGGGGATGG GGGAGACCTG TAGTCAGAGC CCCCGGGCAG 
CACAGGCCAA TGCCCGTCCT TCCCCTGCAG GATGAGTAGT 
GAGTGCCTCT CCTGGCCCTG GAAGTTGCCA CTCCAGTGCC 
CACCAGCCTT GTCCTAATAA AATTAAGTTG CATCATTTTG 
TCTGACTAGG TGTCCTCTAT AATATTAT 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 



40 
80 
120 
160 
200 
228 
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(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 16: 



TGCCATGGCG 

CATATTATGT 

AGTTGCAGCC 

GTTGAACGAG 

CTGGCGGAAC 

GGCACTTCAG 

TGAGTGGATG 

TGT AG T C AG A 

CTTCCCCTGC 

TGTGACCCCT 

CCACTCCAGT 

TTGCATCATT 

TAAGCTTGAT 

AGCGGCGCGT 

AAGGGAGCAG 

CCCGAGCGGC 

TCGACTTCGA 

TTTCAAAAGT 

TAGTTAGGCC 

CATACCTCGC 

CAGTGGCGAT 

CGATAGTTAC 

GGGGTTCGTG 

CACCGAACTG 

GCCACGCTTC 

TAAGCGGCAG 

TCCAGGGGGA 

TTTCGCCACC 

CGTCAGGGGG 

GGCCGGGGGA 

GTGAGGGAGA 

AGTTTTCTTT 

GGCCAATGCT 

GATGCCCCTC 

GCAGAGGATT 

TCGCACAGCC 



CGGATTCTTT 

TTATCAGTGA 

GAATACAGTG 

GTCGGCGTAG 

GGTTGGGGGT 

GAACAAGCGG 

CCTTGACCCC 

GCCCCCGGGC 

AGTGAGTAGT 

CCCCAGTGCC 

GCCCACCAGC 

TTGTCTGACT 

ATCGAATTCT 

CATTTGATAT 

GCCAGTAAAA 

CAAAGGGAGC 

AGGTTCGAAT 

CCGAAAGAAT 

ACCACTTCAA 

TCTGCTAATC 

AAGTCGTGTC 

CGGATAAGGC 

CACACAGCCC 

AGATACCTAC 

CCGAAGGGAG 

GGTCGGAACA 

AACGCCTGGT 

TCTGACTTGA 

GCGGAGCCTA 

TCCGGAGAGC 

GACAGAGACT 

TCCGTTTTGT 

TGGTTGCTAT 

AACTGGCCCT 

CCTGCAGAGG 

TCTCCCACAG 



ATCACTGATA 

TAAAGTGTCA 

ATCCGTGCCG 

ACGGTCTGAC 

GCAGCAGCCG 

GCGCCTTAAG 

AGGCGGGGAT 

AGCACAGGCC 

GACTGCCCGG 

TCTCCTGGCC 

CTTGTCCTAA 

AGGTGTCCTC 

TTCTCAACGT 

GATGCGCCCC 

GCATTACCCG 

AGACTCTAAA 

CCTTCCCCCA 

TCCTGCAGCC 

GAACTCTGTA 

CTGTT AC C AG 

TTACCGGGTT 

GCAGCGGTCG 

AGCTTGGAGC 

AGCGTGAGCA 

AAAGGCGGAC 

GG AG AG CG C A 

ATCTTTATAG 

GCGTCGATTT 

TGGAAAAACG 

TCACTCTAGA 

CGAATTTCCG 

GCAATTTCAC 

TTTGGAAACT 

ATAAAGGGCC 

ATCAAGACAG 

GTACC 



AGTTGGTGGA 

AGCATGACAA 

G C C CTGG ACT 

GACACGCAAA 

G CG CTTT ACT 

GGCCATATGG 

GGGGGAGACC 

AATGCCCGTC 

GTGGGATCCC 

CTGGAAGTTG 

TAAAATTAAG 

TATAATATTA 

AACACTTTAC 

GCTTCCCGAT 

TGGTGGGGTT 

TCTGCCGTCA 

CCACCATCAC 

CGTGTAGCCG 

GCACCGCCTA 

TGGCTGCTGC 

GGACTCAAGA 

GGCTGAACGG 

GAACGACCTA 

TTGAGAAAGC 

AGGTATCCGG 

CGAGGGAGCT 

TCCTGTCGGG 

TTGTGATGCT 

CCAGCAACGC 

TGAGAGAGCA 

GAGCTATTTC 

TTATGATACC 

CCCCTTAGGG 

AGCCTGAGCT 

CACGTGGACC 



40 
80 
120 
160 
200 
240 
280 
320 
360 
400 
440 
480 
520 
560 
600 
640 
680 
720 
760 
800 
840 
880 
920 
960 
1000 
1040 
1080 
1120 
1160 
1200 
1240 
1280 
1320 
1360 
1400 
1425 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : unknown 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17: 



ATGAGCAAG GGCGAGGAAC TGTTCACTGG CGTGGTCCCA 4 0 

ATTCTCGTGG AACTGGATGG CGATGTGAAT GGGCACAAAT 80 

TTTCTGTCAG CGGAGAGGGT GAAGGTGATG CCACATACGG 120 

AAAGCTCACC CTGAAATTCA TCTGCACCAC TGGAAAGCTC 160 

CCTGTGCCAT GGCCAACACT GGTCACTACC TTCACCTATG 200 

GCGTGCAGTG CTTTTCCAGA TACCCAGACC ATATGAACGA 24 0 

GCATGACTTT TTCAAGAGCG CCATGCCCGA GGGCTATGTG 2 80 

CAGGAGAGAA CCATCTTTTT CAAAGATGAC GGGAACTACA 320 

AGACCCGCGC TGAAGTCAAG TTCGAAGGTG ACACCCTGGT 3 60 

GAATAGAATC GAGTTGAAGG GCATTGACTT TAAGGAAGAT 4 00 

GGAAACATTC TCGGCCACAA GCTGGAATAC AACTATAACT 44 0 

CCCACAATGT GTACATCATG GCCGACAAGC AAAAGAATGG 4 80 

CATCAAGGTC AACTTCAAGA T C AG AC AC AA CATTGAGGAT 520 

GG ATCCGTGC AGCTGGCCGA C C ATT AT C AA CAGAACACTC 560 

CAATCGGCGA CCGCCCTGTG CTCCTCCCAG ACAACAATTA 600 

CCTGTCCACC CAGTCTGCCC TGTCTAAAGA TCCCAACGAA 64 0 

AAG AG AG AC C ACATGGTCCT GCTGGAGTTT GTGACCGCTG 680 

CTGGGATCAC ACATGGCATG GACGAGCTGT ACAAGTGAGC 720 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1911 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 18: 



TATGAGCAAG GGCGAGGAAC TGTTCACTGG CGTGGTCCCA 40 

ATTCTCGTGG AACTGGATGG CGATGTGAAT GGGCACAAAT 80 

TTTCTGTCAG CGGAGAGGGT GAAGGTGATG CCACATACGG 12 0 

AAAGCTCACC CTGAAATTCA TCTGCACCAC TGGAAAGCTC 160 

CCTGTGCCAT GGCCAACACT GGTCACTACC TTCACCTATG 2 00 

GCGTGCAGTG CTTTTCCAGA TACCCAGACC ATATGAAGCA 24 0 

GCATGACTTT TTCAAGAGCG CCATGCCCGA GGGCTATGTG 280 
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CAGGAGAGAA CCATCTTTTT CAAAGATGAC GGGAACTACA 320 

AGACCCGCGC TGAAGTCAAG TTCGAAGGTG ACACCCTGGT 3 60 

GAATAGAATC GAGTTGAAGG GCATTGACTT TAAGGAAGAT 4 00 

GGAAACATTC TCGGCCACAA GCTGGAATAC AACTATAACT 44 0 

CCCACAATGT GTACATCATG GCCGACAAGC AAAAGAATGG 480 

CATCAAGGTC AACTTCAAGA TCAGACACAA CATTGAGGAT 52 0 

GGATCCGTGC AGCTGGCCGA CCATTATCAA CAGAACACTC 560 

CAATCGGCGA CGGCCCTGTG CTCCTCCCAG ACAACCATTA 600 

CCTGTCCACC CAGTCTGCCC GTCTAAAGAT CCCAACGAAA 64 0 

AGAGAGACCA CATGGTCCTG CTGGAGTTTG TGACCGCTGC 68 0 

TGGGATCACA CATGGCATGG ACGAGCTGTA CAAGTGAGCC 720 

ATATGGTGAG TGGATGCCTT GACCCCAGGC GGGGATGGGG 7 60 

GAGACCTGTA GTCAGAGCCC CCGGGCAGCA CAGGCCAATG 800 

CCCGTCCTTC CCCTGCAGTG AGTAGTGACT GCCCGGGTGG 84 0 

GATCCCTGTG ACCCCTCCCC AGTGCCTCTC CTGGCCCTGG 880 

AAGTTGCCAC TCCAGTGCCC ACCAGCCTTG TCCTAATAAA 92 0 

ATTAAGTTGC ATCATTTTGT CTGACTAGGT GTCCTCTATA 960 

ATATTATAAG CTTGATATCG AATTCTTTCT CAACGTAACA 1000 

CTTTACAGCG GCGCGTCATT TGATATGATG CGCCCCGCTT 104 0 

CCCGATAAGG GAGCAGGCCA GT AAAAG CAT TACCCGTGGT 1080 

GGGGTTCCCG AGCGGCCAAA GGGAGCAGAC TCTAAATCTG 1120 

CCGTCATCGA CTTCGAAGGT TCGAATCCTT CCCCCACCAC 1160 

CATCACTTTC AAAAGTCCGA AAGAATTCCT GCAGCCCGTG 12 00 

TAGCCGTAGT TAGGCCACCA CTTCAAGAAC TCTGTAGCAC 124 0 

CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC 1280 

TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC 132 0 

TCAAGACGAT AGTTACCGGA TAAGGCGCAG CGGTCGGGCT 1360 

GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC 14 00 

GACCTACACC GAACTGAGAT ACCTACAGCG TGAGCATTGA 1440 

GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG GCGGACAGGT 1480 

ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG 152 0 

GGAGCTTCCA GGGGGAAACG CCTGGTATCT TTATAGTCCT 1560 

GTCGGGTTTC GCCACCTCTG ACTTGAGCGT CGATTTTTGT 1600 

GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG 164 0 

CAACGCGGCC GGGGGATCCG GAGAGCTCAC TCTAGATGAG 1680 

AG AG C AGTG A G G GAG AG AC A GAGACTCGAA TTTCCGGAGC 1720 

TATTTCAGTT TTCTTTTCCG TTTTGTGCAA TTTCACTTAT 17 60 

GATACCGGCC AATGCTTGGT TGCTATTTTG GAAACTCCCC 1800 

TTAGGGGATG CCCCTCAACT GGCCCTATAA AGGGCCAGCC 1840 

TGAGCTGCAG AGGATTCCTG CAGAGGATCA AGACAGCACG 18 8 0 

TGGACCTCGC ACAGCCTCTC CCACAGGTAC C 1911 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19; 
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Pro 


Asp 


Leu 


Ser 


Tyr 


Met 


Pro 


He 


Trp 


Lys 


Phe 


Pro 


1 






5 










10 






Asp 


Glu 


Glu 


Gly 


Ala 


Cys 


Gin 


Pro 


Cys 


Pro 


He 


Asn 




15 










20 








Gly 


Cys 


Thr 


His 


Ser 


Cys 


Val 


Asp 


Leu 


Asp 


Asp 


Lys 


25 










30 










35 




Cys 


Pro 


Ala 


Glu 


Gin 


Arg 


Ala 


Ser 


Pro 


Leu 


Thr 


Ser 






40 










45 






Val 


He 


He 


Ser 


Ala 


Val 


Val 


Gly 


He 


Leu 


Leu 


Val 


50 










55 










60 


Val 


Leu 


Gly 


Val 


Val 


Phe 


Gly 


He 


Leu 









65 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 287 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Pro Ala Pro Gly Ala Gly Gly Met Val His His Arg 

1 5 10 

His Arg Ser Ser Ser Thr Arg Ser Gly Gly Gly Asp 

15 20 
Leu Thr Leu Gly Leu Glu Pro Ser Glu Glu Glu Ala 

25 30 35 

Pro Arg Ser Pro Leu Ala Pro Ser Glu Gly Ala Gly 

40 45 
Ser Asp Val Phe Asp Gly Asp Leu Gly Met Gly Ala 
50 55 60 

Ala Lys Gly Leu Ser Leu Pro Thr His Asp Pro Ser 

65 70 
Pro Leu Gin Arg Tyr Ser Glu Asp Pro Thr Val Pro 

75 80 
Leu Pro Ser Glu Thr Asp Gly Tyr Val Ala Pro Leu 

85 90 95 

Thr Cys Ser Pro Gin Pro Glu Tyr Val Asn Gin Pro 

100 105 
Asp Val Arg Pro Pro Pro Ser Pro Arg Glu Gly Pro 
HO 115 120 

Leu Pro Ala Ala Arg Pro Ala Gly Ala Thr Leu Glu 

125 130 
Arg Pro Lys Thr Leu Ser Pro Gly Lys Asn Gly Val 
135 140 
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Val 


Lys 


Asp 


Val 


Phe 


Ala 


Phe 


Gly 


Gly 


Ala 


Val 


Glu 


145 








150 










155 




Asn 


Pro 


Glu 


Tyr 
160 


Leu 


Thr 


Pro 


Gin 


Gly 
165 


Thr 


Cys 


Ser 


Pro 


Gin 


Pro 


Glu 


Tyr 


Val 


Asn 


Gin 


Pro 


Asp 


Val 


Arg 


170 








175 










loO 


Pro 


Gin 


Pro 


Pro 


Ser 


Pro 


Arg 


Glu 


Gly 


Pro 


Leu 


Pro 








185 










190 






Ala 


Ala 


Arg 
195 


Pro 


Ala 


Gly 


Ala 


Thr 
200 


Leu 


Glu 


Arg 


Pro 


Lys 


Leu 


Ser 


Pro 


Gly 


Lys 


Asn 


Gly 


Val 


Val 


Lys 


Asp 


205 










210 










215 


Glu 


Val 


Phe 


Ala 


Phe 


Gly 


Gly 


Ala' Val 


Glu 


Asn 


Pro 








220 










225 








Tyr 


Leu 


Thr 


Pro 


Gin 


Gly 


Gly 


Ala 


Ala 


Pro 


Gin 


Pro 


230 










235 










240 


His 


Pro 


Pro 


Pro 


Ala 


Phe 


Ser 


Pro 


Ala 


Phe 


Asp 


Asn 








245 










250 






Leu 


Tyr 


Tyr 


Trp 


Asp 


Asp 


Pro 


Pro 


Glu 


Arg 


Gly 


Ala 




255 










260 










Pro 


Pro 


Ser 


Thr 


Phe 


Lys 


Gly 


Thr 


Pro 


Thr 


Ala 


Glu 


260 










270 










275 




Asn 


Pro 


Glu 


Tyr 
280 


Leu 


Gly 


Leu 


Asp 


Val 
285 


Pro 


Val 





(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 2 amino acids 

(B) TYPE: amino acid 

( D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 21: 



lie lie Ser Ala Val Val Gly He Leu Leu Val Val 
15 10 

Val Leu Gly Val Val Phe Gly He Leu He 
15 20 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



WO 98/06863 €^ ^>CT/US97/14306 



- 88 - 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GCCACCATGG CCCCTGACCT CTCCTACATG CCCATCTGGA 40 
AGTTTCCAGA TGAGGAGGGC GCATGCCAGC CTTGCCCCAT 80 

CAACTGCACC CACTCCTGTG TGGACCTGGA TGACAAGGGC 12 0 

TGCCCCGCCG AGCAGAGAGC CAGCCCTCTG ACGTCCATCA 160 

TCTCTGCGGT GGTTGGCATT CTG CTGGTCG TGGTCTTGGG 2 00 

GGTGGTCTTT GGGATCCTCA TCAAGCGACG GCAGCAGAAG 240 

ATCACATGTC CAGACCCTGC CCCGGGCGCT GGGGGCATGG 280 

TCCACCACAG GCACCGCAGC TCATCTACCA GGAGTGGCGG 3 20 

TGGGGACCTG ACACTAGGGC TGGAGCCCTC TGAAGAGGAG 3 60 

GCCCCCAGGT CTCCACTGGC ACCCTCCGAA GGGGCTGGCT 4 00 

CCGATGTATT TGATGGTGAC CTGGGAATGG GGGCAGCCAA 440 

GGGGCTGCAA AGCCTCCCCA CACATGACCC CAGCCCTCTA 4 80 

CAGCGGTACA GTGAGGACCC CACAGTACCC CTGCCCTCTG 520 

AGACTGATGG CTACGTTGCC CCCCTGACCT GCAGCCCCCA 560 

GCCTGAATAT GTGAACCAGC CAGATGTTCG GCCCCAGCCC 600 

CCTTCGCCCC GAGAGGGCCC TCTGCCTGCT GCCCGACCTG 640 
CTGGTGCCAC TCTGGAAAGG CCCAAGACTC TCTCCCCAGG 680 

GAAGAATGGG GTCGTCAAAG ACGTTTTTGC CTTTGGGGGT 720 
GCCGTGGAGA ACCCCGAGAC TTGACACCCC AGGGAGGAGC 7 60 

TGCCCCTCAG CCCCACCCTC CTCCTGCCTT CAGCCCAGCC 800 
TTCGACAACC TCTATTACTG GGACCAGGAC CCACCAGAGC 840 
GGGGGGCTCC ACCCAGCACC TTCAAAGGGA CACCTACGGC 880 
AGAGAACCCA GAGTACCTGG GTCTGGACGT GCCAGTGTGA 92 0 

AGCCTTAAGG GCCATATGGT GAGTGGATGC CTTGACCCCA 960 

GGCGGGGATG GGGGAGACCT GTAGTCAGAG CCCCCGGGCA 1000 

GCACAGGCCA ATGCCCGTCC TTCCCCTGCA GTGAGTAGTG 1040 

ACTGCCCGGG TGGGATCCCT GTGACCCCTC CCCAGTGCCT 1080 

CTCCTGGCCC TGGAAGTTGC CACTCCAGTG CCCACCAGCC 1120 

TTGTCCTAAT AAAATTAAGT TGCATCATTT TGT CTG ACT A 1160 

GGTGTCCTCT ATAATATTAT AAGCTTGATA TCGAATTCTT 1200 

TCTCAACGTA ACACTTTACA GCGGCGCGTC ATTTGATATG 1240 

ATGCGCCCCG CTTCCCGATA AGGGAGCAGG CCAGTAAAAG 1280 

CATTACCCGT GGTGGGGTTC CCGAGCGGCC AAAGGGAGCA 1320 

GACTCTAAAT CTGCCGTCAT CGACTTCGAA GGTTCGAATC 1360 

CTTCCCCCAC CACCATCACT TTCAAAAGTC CGAAAGAATT 14 00 

CCTGCAGCCC GTGTAGCCGT AGTTAGGCCA CCACTTCAAG 1440 

AACTCTGTAG CACCGCCTAC ATACCTCGCT CTG CTAATCC 1480 

TGTTACCAGT GGCTGCTGCC AGTGGCGATA AGTCGTGTCT 1520 

TAC CGGGTTG GACTCAAGAC GATAGTTACC GGATAAGGCG 1560 

CAGCGGTCGG GCTGAACGGG GGGTTCGTGC ACACAGCCCA 1600 

GCTTGGAGCG AACGACCTAC ACCGAACTGA GATACCTACA 1640 

GCGTGAGCAT TGAGAAAGCG CCACGCTTCC CGAAGGGAGA 1680 

AAGGCGGACA GGTATCCGGT AAGCGGCAGG GTCGGAACAG 1720 

GAGAGCGCAC GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA 17 60 
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TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG 182 0 

CGTCGATTTT TGTGATGCTC GTCAGGGGGG CGGAGCCTAT I860 

GGAAAAACGC CAGCAACGCG GCCGGGGGAT CCGGAGAGCT 1900 

CACTCTAGAT GAGAGAGCAG TGAGGGAGAG ACAGAGACTC 194 0 

GAATTTCCGG AG CT ATTTC A GTTTTCTTTT CCGTTTTGTG 198 0 

CAATTTCACT TATGATACCG GCCAATGCTT GGTTGCTATT 2020 

TTGGAAACTC CCCTTAGGGG ATGCCCCTCA ACTGGCCCTA 2 060 

TAAAGGGCCA GCCTGAGCTG CAGAGGATTC CTGCAGAGGA 2100 

TCAAGACAGC ACGTGGACCT CGCACAGCCT CTCCCACAGG 214 0 

TACCT 2145 



(2) INFORMATION FOR SEQ ID NO: 23 



(i) SEQUENCE CHARACTERISTICS: 

2 (A) LENGTH: 27 base pairs 
fH (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
31 (D) TOPOLOGY: unknown 

HI 

(ii) MOLECULE TYPE: oligonucleotide 
i y (iii) HYPOTHETICAL: No 

!^ (iv) ANTI-SENSE: No 

P (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 23: 

03 GTCTGCCACC ATGGCCTACT CCCCTGC 27 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 24: 
TTCTTTGGTG ACCTACCTCT TCGGAATTGC CGAGTC 3 6 



(2) INFORMATION FOR SEQ ID NO: 25: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(Xi) SEQUENCE DESCRIPTION :SEQ ID NO: 25: 

ATGGAGGAGC CGCAGTCAGA TCCTAGCGTC GAGCCCCCTC 40 

TGAGTCAGGA AAC ATTTT C A GACCTATGGA AACTACTTCC 80 

TGAAAACAAC GTTCTGTCCC CCTTGCCGTC CCAAGCAATG 120 

GATGATTTGA TGCTGTCCCC GGACGATATT GAACAATGGT 160 

TCACTGAAGA CCCAGGTCCA GATGAAGCTC CCAGAATGCC 200 

AGAGGCTGCT CCCCGCGTGG CCCCTGCACC AGCAGCTCCT 240 

ACACCGGCGG CCCCTGCACC AGCCCCCTCC TGGCCCCTGT 280 

CATCTTCTGT CCCTTCCCAG AAAACCTACC AGGGCAGCTA 320 

CGGTTTCCGT CTGGGCTTCT TGCATTCTGG GACAGCCAAG 3 60 

TCTGCCACCA TGGCCTACTC CCCTGCGTCT GTGACTTGCA 400 

CGTACTCCCC TGCCCTCAAC AAGATGTTTT GCCAACTGGC 440 

CAAGACCTGC CCTGTGCAGC TGTGGGTTGA TTCCACACCC 480 

CCGCCCGGCA CCCGCGTCCG CGCCATGGCC ATCTACAAGC 520 

AGTCACAGCA CATGACGGAG GTTGTGAGGC GCTGCCCCCA 560 

CCATGAGCGC TGCTCAGATA GCGATGGTCT GGCCCCTCCT 600 

CAGCGTCTTA TCCGAGTGGA AGGAAATTTG CGTGTGGAGT 64 0 

ATTTGGATGA CAGAAACACT TTTCGACATA GTGTGGTGGT 680 

GCCCTATGAG CCGCCTGAGG TTGGCTCTGA CTGTACCACC 720 

ATCCACTACA ACTACATGTG TAACAGTTCC TGCATGGGCG 760 

GCATGAACCG GAGGCCCATC CTCACCATCA TCACACTGGA 800 

AGACTCCAGT GGTAATCTAC TGGGACGGAA CAGCTTTGAG 84 0 

GTGCGTGTTT GTGCCTGTCC TGGGAGAGAC CGGCGCACAG 880 

AGGAAGAGAA TCTCCGCAAG AAAGGGGAGC CTCACCACGA 920 

GCTGCCCCCA GGGAGCACTA AG CG AG C ACT GCCCAACAAC 960 

ACCAGCTCCT CTCCCCAGCC AAAGAAGAAA CCACTGGATG 1000 

GAGAATATTT CACCCTTCAG ATCCGTGGGC GTGAGCGCTT 104 0 

CGAGATGTTC TTTGGTGACC TACCTCTTCG GAATTGCCGA 1080 

GTCTTCCGAG AG CTG AATGA GGCCTTGGAA CTCAAGGATG 112 0 

CCCAGGCTGG GAAGGAGCCA GGGGGGAGCA GGGCTCACTC 1160 

CAGCCACCTG AAGTCCAAAA AGGGTCAGTC TACCTCCCGC 1200 

CATAAAAAAC TCATGTTCAA GACAGAAGGG CCTGACTCAG 124 0 

AC 1242 



(2) INFORMATION FOR SEQ ID NO: 26: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 26: 



CTCGGGCCGC GTTGCTGGCG TTTTTCCATA GGCTCCGCCC 
CCCTGACGAG CATCACAAAA ATCGACGCTC AAGTCAGAGG 
TGGCGAAACC CGACAGGACT ATAAAGATAC CAGGCGTTTC 
CCCCTGGAAG CTCCCTCGTG CGCTCTCCTG TTCCGACCCT 
GCCGCTTACC GGATACCTGT CCGCCTTTCT CCCTTCGGGA 
AGCGTGGCGC TTTCTCAATG CTCACGCTGT AGGTATCTCA 
GTTCGGTGTA GGTCGTTCGC TCCAAGCTGG GCTGTGTGCA 
CGAACCCCCC GTTCAGCCCG ACCGCTGCGC CTTATCCGGT 
AACTATCGTC TTGAGTCCAA CCCGGTAAGA CACGACTTAT 
CGCCACTGGC AGCAGCCACT GGTAACAGGA TT AG C AG AG C 
GAGGTATGTA GGCGGTGCTA CAGAGTTCTT GAAGTGGTGG 
CCTAACTACG GCTACACTAG AAGGACAGTA TTTGGTATCT 
GCGCTCTGCT GAAGCCAGTT ACCTTCGGAA AAAGAGTTGG 
TAGCTCTTGA TCCGGCAAAC AAACCACCGC TGGTAGCGGT 
GGTTTTTTTG TTTGCAAGCA GCAGATTACG CG C AG AAAAA 
AAGGATCT 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL : No 

( iv) ANTI-SENSE : No 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 



GGTACCTGCC ACCATGGCGC GGATTCTTTA TCACTGATAA 
GTTGGTGGAC ATATTATGTT TATCAGTGAT AAAGTGTCAA 
GCATGACAAA GTTGCAGCCG AATACAGTGA TCCGTGCCGG 
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CCCTGGACTG TTGAACGAGG TCGGCGTAGA CGGTCTGACG 160 

ACACGCAAAC TGGCGGAACG GTTGGGGGTG CAGCAGCCGG 200 

CGCTTTACTG GCACTTCAGG AACAAGCGGG CGCCTTAAGG 240 

GCCATATGGT GAGTGGATGC CTTGACCCCA GGCGGGGATG 280 

GGGGAGACCT GTAGTCAGAG CCCCCGGGCA GCACAGGCCA 320 

ATGCCCGTCC TTCCCCTGCA GGATGAGTAG TGAGTGCCTC 360 

TCCTGGCCCT GGAAGTTGCC ACTCCAGTGC CCACCAGCCT 400 

TGTCCTAATA AAATTAAGTT GCATCATTTT GTCTGACTAG 44 0 

GTGTCCTCTA TAATATTATA AG CTTG AT AT CGAATTCTTT 480 

CGGACTTTTG AAAGTGATGG TGGTGGGGGA AGGATTCGAA 520 

CCTTCGAAGT CGATGACGGC AGATTTAGAG TCTGCTCCCT 560 

TTGGCCGCTC GGGAACCCCA CCACGGGTAA TGCTTTTACT 600 

GGCCTGCTCC CTTATCGGGA AGCGGGGCGC ATCATATCAA 64 0 

ATGACGCGCC GCTGTAAAGT GTTACGTTGA GAAAGAATTC 680 

CTGCAGCCCG CCGCGTTGCT GGCGTTTTTC CATAGGCTCC 7 20 

GCCCCCCTGA CGAGCATCAC AAAAATCGAC GCTCAAGTCA 760 

GAGGTGGCGA AACCCGACAG GACTATAAAG ATACCAGGCG 800 

TTTCCCCCTG GAAGCTCCCT CGTGCGCTCT CCTGTTCCGA 840 

CCCTGCCGCT TACCGGATAC CTGTCCGCCT TTCTCCCTTC 880 

GGGAAGCGTG GCGCTTTCTC AATGCTCACG CTGTAGGTAT 920 

CTCAGTTCGG TGTAGGTCGT TCGCTCCAAG CTGGGCTGTG 960 

TGCACGAACC CCCCGTTCAG CCCGACCGCT GCGCCTTATC 1000 

CGGTAACTAT CGTCTTGAGT CCAACCCGGT AAGACACGAC 104 0 

TTATCGCCAC TGGCAGCAGC CACTGGTAAC AGGATTAGCA 1080 

GAGCGAGGTA TGTAGGCGGT GCTACAGAGT TCTTGAAGTG 112 0 

GTGGCCTAAC TACGGCTACA CTAGAAGGAC AGTATTTGGT 1160 

ATCTGCGCTC TGCTGAAGCC AGTTACCTTC GGAAAAAGAG 12 00 

TTGGTAGCTC TTGATCCGGC AAACAAACCA CCGCTGGTAG 1240 

CGGTGGTTTT TTTGTTTGCA AGCAGCAGAT TACGCGCAGA 1280 

AAAAAAGGAT CTGGGGGATC CGGAGAGCTC ACTCTAGATG 1320 

AGAGAGCAGT GAGGGAGAGA CAGAGACTCG AATTTCCGGA 13 60 

GCTATTTCAG TTTTCTTTTC CGTTTTGTGC AATTTCACTT 1400 

ATGATACCGG CCAATGCTTG GTTGCTATTT TGGAAACTCC 144 0 

CCTTAGGGGA TGCCCCTCAA CTGGCCCTAT AAAGGGCCAG 1480 

CCTGAGCTGC AGAGGATTCC TG C AG AGG AT CAAGACAGCA 1520 

CGTGGACCTC GCACAGCCTC TCCCACA 1547 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
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GGTACCTGCC ACCATGGCGC GGATTCTTTA TCACTGATAA 40 

GTTGGTGGAC ATATTATGTT TATCAGTGAT AAAGTGTCAA 80 

GCATGACAAA GTTGCAGCCG AATACAGTGA TCCGTGCCGG 120 

CCCTGGACTG TTGAACGAGG TCGGCGTAGA CGGTCTGACG 160 

ACACGCAAAC TGGCGGAACG GTTGGGGGTG CAGCAGCCGG 200 

CGCTTTACTG GCACTTCAGG AACAAGCGGG CGCCTTAAGG 240 

GCCATATGGT GAGTGGATGC CTTGACCCCA GGCGGGGATG 280 

GGGGAGACCT GTAGTCAGAG CCCCCGGGCA GCACAGGCCA 320 

ATGCCCGTCC TTCCCCTGCA GGATGAGTAG TGAGTGCCTC 360 

TCCTGGCCCT GGAAGTTGCC ACTCCAGTGC CCACCAGCCT 400 

TGTCCTAATA AAATTAAGTT GCATCATTTT GTCTGACTAG 440 

GTGTCCTCTA TAATATTATA AG CTTG AT AT CGAATTCTTT 480 

CGGACTTTTG AAAGTGATGG TGGTGGGGGA AGGATTCGAA 520 

CCTTCGAAGT CGATGACGGC AGATTTAGAG TCTGCTCCCT 560 

TTGGCCGCTC GGGAACCCCA CCACGGGTAA TGCTTTTACT 600 

GGCCTGCTCC CTTATCGGGA AGCGGGGCGC ATCATATCAA 640 

ATGACGCGCC GCTGTAAAGT GTTACGTTGA GAAAGAATTC 680 

CTGCAGCCCG CCGCGTTGCT GGCGTTTTTC CATAGGCTCC 720 

GCCCCCCTGA CGAGCATCAC AAAAATCGAC GCTCAAGTCA 760 

GAGGTGGCGA AACCCGACAG GACTATAAAG ATACCAGGCG 800 

TTTCCCCCTG GAAGCTCCCT CGTGCGCTCT CCTGTTCCGA 840 

CCCTGCCGCT TACCGGATAC CTGTCCGCCT TTCTCCCTTC 88 0 

GGGAAGCGTG GCGCTTTCTC AATGCTCACG CTGTAGGTAT 92 0 

CTCAGTTCGG TGTAGGTCGT TCGCTCCAAG CTGGGCTGTG 960 

TGCACGAACC CCCCGTTCAG CCCGACCGCT GCGCCTTATC 1000 

CGGTAACTAT CGTCTTGAGT CCAACCCGGT AAGACACGAC 104 0 

TTATCG C C AC TGGCAGCAGC CACTGGTAAC AGGATTAGCA 1080 

GAGCGAGGTA TGTAGGCGGT GCTACAGAGT TCTTGAAGTG 1120 

GTGGCCTAAC TACGGCTACA CTAGAAGGAC AGTATTTGGT 1160 

ATCTGCGCTC TGCTGAAGCC AGTTAC CTTC GGAAAAAGAG 1200 

TTGGTAGCTC TTGATCCGGC AAACAAACCA CCGCTGGTAG 124 0 

CGGTGGTTTT TTTGTTTGCA AG C AGC AG AT TACGCGCAGA 1280 

AAAAAAGGAT CTGGGGGATC CGGAGAGCTC CCAACGCGTT 132 0 

GGATGCATGG ATGAGGGAAA GGAGGTAAGA TCTGTAATGA 1400 

ATAAGCAGGA ACTTTGAAGA CTCAGTGACT CAGTGAGTAA 1440 

TAAAGACTCA GTGACTTCTG ATCCTGTCCT AACTGCCACT 1480 

CCTTGTTGTC CCAAGAAAGC GGCTTCCTGC TCTCTGAGGA 152 0 

GGACCCCTTC CCTGGAAGGT AAAACTAAGG ATGTCAG CAG 1560 

AGAAATTTTT CCACCATTGG TGCTTGGTCA AAGAGGAAAC 1600 

TGATG AG CTC ACTCTAGATG AGAGAGCAGT GAGGGAGAGA 164 0 

CAGAGACTCG AATTTC CGG A GCTATTTCAG TTTTCTTTTC 1680 

CGTTTTGTGC AATTTCACTT ATGATACCGG CCAATGCTTG 172 0 

GTTGCTATTT TGGAAACTCC CCTTAGGGGA TGCCCCTCAA 1760 

CTGGCCCTAT AAAGGGCCAG CCTGAGCTGC AGAGGATTCC 1800 

TGCAGAGGAT CAAGACAGCA CGTGGACCTC GCACAGCCTC 184 0 

TCCCACA 1847 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:29: 



GGTACCTGCC ACCATGGCGA AGGGCGAGGA ACTGTTCACT 4 0 

GGCGTGGTCC CAATTCTCGT GGAACTGGAT GGCGATGTGA 80 

ATGGGCACAA ATTTTCTGTC AGCGGAGAGG GTGAAGGTGA 120 

TGCCACATAC GGAAAGCTCA CCCTGAAATT CATCTGCACC 160 

ACTGGAAAGC TCCCTGTGCC ATGGCCAACA CTGGTCACTA 2 00 

CCTTCACCTA TGGCGTGCAG TGCTTTTCCA GATACCCAGA 24 0 

CCATATGAAG CAGCATGACT TTTTCAAGAG CGCCATGCCC 280 

GAGGGCTATG TG CAGG AG AG AACCATCTTT TTCAAAGATG 32 0 

ACGGGAACTA CAAGACCCGC GCTGAAGTCA AGTTCGAAGG 360 

TGACACCCTG GTGAATAGAA TCGAGTTGAA GGGCATTGAC 400 

TTTAAGGAAG ATGGAAACAT TCTCGGCCAC AAGCTGGAAT 44 0 

ACAACTATAA CTCCCACAAT GTGTACATCA TGGCCGACAA 480 

GCAAAAGAAT GGCATCAAGG TCAACTTCAA GATCAGACAC 520 

AACATTGAGG ATGGATCCGT GCAGCTGGCC GACCATTATC 560 

AACAGAACAC TCCAATCGGC GACGGCCCTG TGCTCCTCCC 600 

AGACAACCAT TACCTGTCCA CCCAGTCTGC CCTGTCTAAA 64 0 

GATCCCAACG AAAAGAGAGA CCACATGGTC CTGCTGGAGT 68 0 

TTGTGACCGC TGCTGGGATC ACACATGGCA TGGACGAGCT 720 

GTACAAGTGA GCGCCTTAAG GGCCATATGG TGAGTGGATG 7 60 

CCTTGACCCC AGGCGGGGAT GGGGGAGACC TGTAGTCAGA 800 

GCCCCCGGGC AGCACAGGCC AATGCCCGTC CTTCCCCTGC 84 0 

AGGATGAGTA GTGAGTGCCT CTCCTGGCCC TGGAAGTTGC 880 

CACTCCAGTG CCCACCAGCC TTGTCCTAAT AAAATTAAGT 920 

TGCATCATTT TGTCTGACTA GGTGTCCTCT ATAATATTAT 960 

AAGCTTGATA TCGAATTCTT TCGGACTTTT GAAAGTGATG 1000 

GTGGTGGGGG AAGGATTCGA ACCTTCGAAG TCGATGACGG 104 0 

CAGATTTAGA GTCTGCTCCC TTTGGCCGCT CGGGAACCCC 1080 

ACCACGGGTA ATGCTTTTAC TGGCCTGCTC CCTTATCGGG 112 0 

AAGCGGGGCG CATCATATCA AATGACGCGC CGCTGTAAAG 1160 

TGTTACGTTG AGAAAGAATT CCTGCAGCCC GCCGCGTTGC 1200 

TGGCGTTTTT CCATAGG CTC CGCCCCCCTG ACGAGCATCA 124 0 

CAAAAATCGA CGCTCAAGTC AG AGGTGG CG AAACCCGACA 1280 

GGACTATAAA GATACCAGGC GTTTCCCCCT GGAAGCTCCC 132 0 

TCGTGCGCTC TCCTGTTCCG ACCCTGCCGC TTACCGGATA 1360 

CCTGTCCGCC TTTCTCCCTT CGGGAAGCGT GGCGCTTTCT 1400 

CAATGCTCAC GCTGTAGGTA TCTCAGTTCG GTGTAGGTCG 144 0 

TTCG CTCCAA GCTGGGCTGT GTGCACGAAC CCCCCGTTCA 1480 

GCCCGACCGC TGCGCCTTAT CCGGTAACTA TCGTCTTGAG 152 0 

TCCAACCCGG TAAGACACGA CTTATCGCCA CTGGCAGCAG 1560 
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CCACTGGTAA C AGG ATT AG C AGAGCGAGGT ATGTAGGCGG 1600 

TGCTACAGAG TTCTTGAAGT GGTGGCCTAA CTACGGCTAC 164 0 

ACTAGAAGGA CAGTATTTGG TATCTGCGCT CTGCTGAAGC 1680 

CAGTTACCTT CGGAAAAAGA GTTGGTAGCT CTTGATCCGG 172 0 

CAAACAAACC ACCGCTGGTA GCGGTGGTTT TTTTGTTTGC 17 60 

AAGCAGCAGA TTACGCGCAG AAAAAAAGGA TCTGGGGGAT 1800 

CCGGAGAGCT CCCAACGCGT TGGATGCATG GATGAGGGAA 184 0 

AGGAGGTAAG ATCTGTAATG AATAAGCAGG AACTTTGAAG 1880 

ACTCAGTGAC TCAGTGAGTA ATAAAGACTC AGTGACTTCT 1920 

GATCCTGTCC TAACTGCCAC TCCTTGTTGT CCCAAGAAAG 1960 

CGGCTTCCTG CTCTCTGAGG AGGACCCCTT CCCTGGAAGG 2 000 

TAAAACTAAG GATGTCAGCA GAGAAATTTT TCCACCATTG 204 0 

GTGCTTGGTC AAAGAGGAAA CTGATGAGCT CACTCTAGAT 2080 

GAG AG AG C AG TGAGGGAGAG ACAGAGACTC GAATTTCCGG 212 0 

AG CT ATTTC A GTTTTCTTTT CCGTTTTGTG CAATTTCACT 2160 

TATGATACCG GCCAATGCTT GGTTG CTATT TTGGAAACTC 2 2 00 

CCCTTAGGGG ATGCCCCTCA ACTGGCCCTA TAAAGGGCCA 224 0 

GCCTGAGCTG CAGAGGATTC CTGCAGAGGA TCAAGACAGC 2 280 

ACGTGGACCT CGCACAGCCT CTCCCACA 2 3 08 



